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DETAILED ACTION 

1. Claims 1-29 have been examined. 



Papers Submitted 

2. It is hereby acknowledged that the following papers have been received and placed of 
record in the file: RCE and Amendment as received on 3/1 1/2005. 



Claim Objections 

3. Claim 18 is objected to because of the following informalities: The paragraph beginning 
with "from" should not be separated from the previous paragraph. Please combine them 
together. Also, for increase clarity, it is suggested that applicant insert —for— before 
"simultaneously" and "independently". Appropriate correction is required. 

4. Claim 23 is objected to because of the following informalities: Please replace 
"synchronous" in line 6 with —synchronously—. Appropriate correction is required. 

Claim Rejections - 35 USC § 112 

5. The following is a quotation of the second paragraph of 35 U.S.C. 1 12: 

The specification shall conclude with one or more claims particularly pointing out and distinctly claiming the 
subject matter which the applicant regards as his invention. 

6. Claim 23 is rejected under 35 U.S.C. 1 12, second paragraph, as being indefinite for 
failing to particularly point out and distinctly claim the subject matter which applicant regards as 
the invention. More specifically, it appears as if applicant left the end of claim 23 unfinished. It 
is unclear as to what applicant is trying to say in this portion of the claim. Consequently, the last 
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portion of the claim ("wherein the synchronous driving and the independent driving.") will be 
ignored for purposes of this examination. 

Claim Rejections - 35 USC §102 

7. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that form the 
basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(b) the invention was patented or described in a printed publication in this or a foreign country or in pubhc use or on 
sale in this country, more than one year prior to the date of appUcation for patent in the United States. 

(e) the invention was described in (1) an application for patent, published under section 122(b), by another filed 
in the United States before the invention by the applicant for patent or (2) a patent granted on an application for 
patent by another filed in the United States before the invention by the applicant for patent, except that an 
international application filed under the treaty defined in section 351(a) shall have the effects for purposes of this 
subsection of an application filed in the United States only if the international application designated the United 
States and was published under Article 21(2) of such treaty in the English language. 

8. Claims 1-6 and 8 are rejected under 35 U.S.C. 102(b) as being anticipated by Parady, 
U.S. Patent No. 5,933,627 (as apphed in the previous Office Action). 

9. Referring to claim 1, Parady has taught a processor control apparatus for controlling a 
pluraUty of arithmetic units, said processor control apparatus comprising a plurality of 
instruction control units (Fig.3, component 1 12 and components 102-108, which correspond to 
components 154 in Fig. 5) issuing a series of instructions to said plurality of arithmetic units 
(Fig.5, components 156, 160, 168, and 170), wherein at least one of said instruction control units 
switches between a first execution process driving said plurality of arithmetic units by a single 
series of instructions issued from one of the plurality of instruction control units and a second 
execution process correspondingly driving said plurality of arithmetic units by a plurality of 
different series of instructions issued respectively from said plurality of instruction control units. 
See the abstract and note that while instructions belonging to a first execution process may be 
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issued from a single instruction control unit (Fig. 2, component 102, for instance), a long latency 
event (or some other event) within that process will result m switching to driving arithmetic units 
using a second set of instructions belonging to a second execution process. The second 
execution process is interpreted as being the overall process that is not the first execution 
process. So, series of instructions from buffers 104, 106, and 108 will ultimately be executed, 
which in turn drive the arithmetic units. 

10. Referring to claim 2, Parady has taught a processor control apparatus as described in 
claim 1 . Parady has further taught that said at least one instruction control unit each perform a 
switching process switching between said first execution process and said second execution 
process according to information which is contained in advance in a series of instructions. See 
the abstract, column 3, lines 57-64. Note that execution processes are switched due to multiple 
events such as a load instruction missing the data cache. The load instruction is contained in 
advance in a series of instructions. Also, other supported instructions are jumps to threads. See 
column 4, line 63, to column 5, Une 5. These instructions are also considered information 
contained in advance in the series of instruction. 

1 1 . Referring to claim 3, Parady has taught a processor control apparatus as described in 
claim 1 . Parady has further taught that when an M-th one of said instruction control units issues 
a second series of instructions to an N-th one of said arithmetic units which is performing said 
second execution process based on a first series of instructions issued by an N-th one of said 
instruction control units different from said M-th instruction control unit, said M-th instruction 
control unit is set in a wait state until said N-th arithmetic unit completes said second execution 
process. See column 4, lines 42-52, and note that if the second execution process is of highest 
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priority (or at least higher priority than the process corresponding to the second series of 
instruction), then the M-th control unit will have to wait to issue instructions until the second 
execution process completes. Also, it should be reaUzed that there are M instruction control 
units (Fig.3, components 102-108, and Fig.5, components 154) and N functional units (Fig.l and 
Fig.5). 

1 2. Referring to claim 4, Parady has taught a processor control apparatus as described in 
claim 1. Parady has further taught a first storage element for holding a plurality of series of 
instructions, wherein when an M-th one of said instruction control units issues a second series of 
instructions to an N-th one of said arithmetic units which is performing said second execution 
process based on a first series of instructions issued by an N-th one of said instruction control 
units different from said M-th instruction control unit, said second series of instructions from 
said M-th instruction control unit are stored in said first storage element, and wherein said N-th 
arithmetic unit executes instructions which are stored in said first storage element based on 
information contained in said first series of instructions issued by said N-th instruction control 
unit. See Fig.3, for instance, and note the instruction control unit (say 104 in combination with 
28), comprises a first storage element to hold thread instructions. If the thread 0 control unit 
(102 in combination with 28) issues a load that misses the cache or a jump to thread instruction, 
and the thread 1 control unit is the next to take control, then an Nth arithmetic unit will execute 
the instructions of thread 1 based on information contained in advance in the first series of 
instructions (i.e., the load or jump). 

13. Referring to claim 5, Parady has taught a processor control apparatus as described in 
claim 1 . Parady has further taught a second storage element which operates to hold, when one of 
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said arithmetic units executing a first series of instructions from one of said instruction control 
units is switched to execute a second series of instructions from another instruction control unit, 
data generated by the second series of instructions under execution by associating the data with 
that instruction control unit which is executing the second series of instructions. See Fig. 3, 
components 48 and 50. Note that a register file exists for each thread to store data generated 
during execution of each thread, respectively. 

14. Referring to claim 6, Parady has taught a processor control apparatus as described in 
claim 1 . Parady has further taught that it is determined, based on an instruction executing state 
of each arithmetic unit, one of said arithmetic units to which a new series of instructions is to be 
issued by one of said instruction control units, and wherein said one instruction control unit is 
controlled based on the result of the determination so that the new series of instructions are 
directed to said one arithmetic unit thus determined. More specifically, when the arithmetic units 
are idle (for instance, in the case of a cache miss), a switch will occur, causing the new series of 
instructions to be directed to a particular (or more than one) arithmetic unit. For instance, if the 
thread 1 control unit is the next to issue instructions, and the instructions are all floating point 
multiplication instructions, then they will all be directed to the FP multiply unit 40 shown in 
Fig.l. 

15. Referring to claim 8, Parady has taught a processor control apparatus as described in 
claim 1 . Parady has further taught that each of said series of instructions includes a series of 
time sharing instructions for serially driving a plurality of ones of said arithmetic units. In one 
embodiment of Parady, threads are switched based on a round-robin counter (Fig. 3, component 
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128), which is used to switch between threads at specified time intervals. In essence, the 
instructions in a series all share an amount of time before that thread (series) is switched out. 

16. Claims 1-2, 8-9, 17, 23, 25, and 28-29 are rejected under 35 U.S.C. 102(e) as being 
anticipated by Fernando et al., U.S. Patent No. 6,272,616 (herein referred to as Fernando). 

17. Referring to claim 1, Fernando has taught a processor control apparatus for controlling a 
plurality of arithmetic units (Fig.l, components 26 in component 24b), said processor control 
apparatus comprising a plurality of instruction control units (Fig. 1) issuing a series of 
instructions to said plurality of arithmetic units, wherein at least one of said instruction control 
units switches between a first execution process driving said plurality of arithmetic units by a 
single series of instructions issued from one of the plurality of instruction control units and a 
second execution process correspondingly driving said plurality of arithmetic units by a plurality 
of different series of instructions issued respectively from said plurality of instruction control 
units. See Fig. 1 and notice that at between times A and B, a first process would include 
executing instructions fetched solely by fetcher 20a. Multiplexer 21 would select these 
instructions which would drive units 26 in component 24b. Moreover, between times X and Y, a 
second process would include executing instructions fetched by both fetchers 20a and 20b. 
Multiplexer would select from both sources which would then result in driving units 26 in 
component 24b. 

18. Referring to claim 2, Fernando has taught a processor control apparatus as described in 
claim 1 . Fernando has further taught that said at least one instruction control unit each perform a 
switching process for switching between said first execution process and said second execution 
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process according to information which is contained in advance in a series of instructions. Note 
that the multiplexer 21 (Fig, 1) is what causes a switch in instruction stream execution. This 
switch is caused by a change in signal 32i, which is in response to an instruction (CFORK or 
DFORK) contained in advance in a first instruction stream. See column 5, Hnes 8-16. 

19. Referring to claim 8, Fernando has taught a processor control apparatus as described in 
claim 1 . Fernando has further taught that each of said series of instructions includes a series of 
time-sharing instructions for serially driving a plurality of ones of said arithmetic units. For 
instance, looking at Fig.7 of Fernando, the series of instructions corresponding to thread 2 
execute from time 2 to time 7. Therefore, the thread takes 6 time units to execute and each 
individual instruction requires some portion of the overall 6 time units for execution. Therefore, 
each instruction is a time-sharing instruction, i.e., each instruction shares the overall 6 time units 
with the other instructions. 

20. Referring to claim 9, Fernando has taught a processor control apparatus as described in 
claim 1. Fernando has further taught power control elements for controlling power supply to 
said arithmetic units based on their instruction executing states. See the abstract and column 8, 
lines 10-12, and note that unused processing elements are deactivated in order to conserve 
power. 

21 . Referring to claim 17, Fernando has taught a processor control apparatus comprising: 

a) a single instruction memory for storing a plurality of series of instructions to be executed by a 
plurality of arithmetic units. See Fig. 1, components 12 and 26, Fig.7 shows multiple series of 
instructions being executed by different units. 



Application/Control Number: 09/855,776 Page 9 

Art Unit: 2183 

b) an instruction decoder for decoding a series of instructions from said instruction memory, and 
outputting a decoded result to any of said plurality of arithmetic units. See Fig. 1, component 22a 
and 22b. 

c) a selector selectively switching between a plurality of series of instructions from said 
instruction memory to be decoded by said instruction decoder, and supplying a series of 
instructions thus selected to said instruction decoder. See Fig.l, component 21, and claim 44 of 
Fernando. 

d) wherein said instruction memory has a plurality of ports for issuing said series of instructions 
to said instruction decoder. See Fig. 1 component 12 and note that multiple ports are connected 
to multiple fetch units and ultimately to multiple decoders. 

e) wherein said plurality of series of instructions are issued to said plurality of arithmetic units 
from the instruction memory to be simultaneously and independently driven. Looking at Fig.l, it 
can be seen that when multiplexer 21 selects instructions from fetcher 20a, then the units 26 in 
components 24a and 24b will be simultaneously driven (by a first instruction stream). This is 
shown more specifically in Fig,3. Note that this is the purpose of SIMD instructions: to have 
multiple units execute the same instruction with multiple data items. On the other hand, looking 
at Fig. 1, it can be seen that when multiplexer 2 1 selects instructions from fetcher 20b, then the 
units 26 in components 24a and 24b will be independently driven (the units in 24a will be driven 
by a first instruction stream from fetcher 20a while units in 24b will be driven by a second 
instruction stream from fetcher 20b). Note that this is the purpose of MIMD instructions: to have 
multiple units execute the multiple instructions with multiple data items, 

22. Referring to claim 23, Fernando has taught a processor comprising: 



Application/Control Number: 09/855,776 Page 10 

Art Unit: 2183 

a) a plurality of arithmetic units. See Fig. 1, components 24a, 24b, and 26. 

b) a plurality of instruction control units for issuing a series of instructions to drive said 
arithmetic units in a controlled manner. See Fig. 1 and note a first instruction control unit would 
comprise at least components 20a, 22a, and 12, whereas a second unit would comprise at least 
components 20b, 22b, and 12. Each of these units issues instructions to the arithmetic units. 

c) wherein some of said instruction control units are operable to switch between a first execution 
process for synchronous driving said plurality of arithmetic units by a single series of 
instructions and a second execution process independently driving said plurality of arithmetic 
units by a plurality of different series of instructions, respectively. See Fig. 1, component 21 and 
claim 44. Note that the MUX can either select a first series of instruction from bus 36 or a 
second series from bus 35. If the instructions are selected from bus 36, then the system is in 
SIMD mode, in which the arithmetic units are synchronously driven (driven at the same time, 
i.e., in parallel, by single instructions from a single stream). However, it the instructions are 
selected from bus 35, then the system is in MIMD mode in which the arithmetic units are 
independently driven (the units in 24a will be driven by a first instruction stream from fetcher 
20a while units in 24b will be driven by a second instruction stream from fetcher 20b). 

23. Referring to claim 25, Fernando has taught a processor comprising: 

a) a plurality of arithmetic units. See Fig. 1, components 24 and 26. 

b) a single instruction memory for storing a plurality of series of instructions to be executed by a 
plurality of arithmetic units. See Fig.l, components 12 and 26. Fig.7 shows multiple series of 
instructions being executed by different units. 
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c) an instruction decoder for decoding a series of instructions from said instruction memory, and 
outputting a decoded result to any of said plurality of arithmetic units. See Fig. 1, component 22a 
and 22b. 

d) a selector for selectively switching between a plurality of series of instructions from said 
instruction memory to be decoded by said instruction decoder, and supplying a series of 
instructions thus selected to said instruction decoder. See Fig. 1, component 21, and claim 44 of 
Fernando. 

e) wherein said instruction memory has a plurality of ports for issuing said series of instructions 
to said instruction decoder. See Fig. 1 component 12 and note that multiple ports are connected 
to multiple fetch units and ultimately to multiple decoders. 

f) wherein said plurality of series of instructions are issued to said plurality of arithmetic units 
from the instruction memory to be simultaneously and independently driven. Looking at Fig.l, it 
can be seen that when muhiplexer 21 selects instructions from fetcher 20a, then the units 26 in 
components 24a and 24b will be simultaneously driven (by a first instruction stream). This is 
shown more specifically in Fig.3. Note that this is the purpose of SIMD instructions: to have 
multiple units execute the same instruction with multiple data items. On the other hand, looking 
at Fig. 1, it can be seen that when multiplexer 21 selects instructions from fetcher 20b, then the 
units 26 in components 24a and 24b will be independently driven (the units in 24a will be driven 
by a first instruction stream from fetcher 20a while units in 24b will be driven by a second 
instruction stream from fetcher 20b). Note that this is the purpose of MIMD instructions: to have 
multiple units execute the multiple instructions with multiple data items. 
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24. Referring to claim 28, Fernando has taught a processor controlling method usable with a 
plurality of instruction control units for controlling a plurality of arithmetic units to execute a 
pluraUty of series of instructions, said method comprising: 

a) prescribing, in advance in a series of instructions which is to be performed, synchronous 
execution in which a plurality of predetermined ones of said arithmetic units are synchronously 
driven by a single series of instructions, or independent execution in which the plurality of 
predetermined arithmetic units are independently driven by a plurality of respective series of 
instructions. A single series of instructions may be used to drive a plurality of arithmetic units. 
This would occur when the selector 21 shown in Fig. 1 selects the instructions from bus 36. This 
corresponds to Fig.7 at times 10-12 when a single series of instruction (thread 1) is executed by 
two different data paths (arithmetic units), and hence results in the "combined control" illustrated 
in the figure. On the other hand, independent execution of a plurality of series of instructions by 
a plurality of arithmetic units is also shown in Fig.7. For instance, two series (corresponding to 
thread 1 and 2) are independently executed by data paths 1 and 2 (arithmetic units). This would 
occur when MUX 21 in Fig. 1 selects the group of instructions from bus 35. 

b) switching between the synchronous driving and the independent driving of the predetermined 
arithmetic units for performing a series of instructions based on the contents of the prescription 
therein. Clearly from the examples given in Fig, 7, synchronous execution of a single series or 
independent execution of multiple series of instructions is switched back and forth. See Fig.7, 
and notice data paths 1 and 2 from time 1-12. Again, looking at Fig.l, it can be seen that when 
multiplexer 21 selects instructions from fetcher 20a, then the units 26 in components 24a and 24b 
will be simultaneously driven (by a first instruction stream). This is shown more specifically in 
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Fig. 3. Note that this is the purpose of SIMD instructions: to have muhiple units execute the 
same instruction with muhiple data items. On the other hand, looking at Fig. 1, it can be seen that 
when multiplexer 21 selects instructions from fetcher 20b, then the units 26 in components 24a 
and 24b will be independently driven (the units in 24a will be driven by a first instruction stream 
from fetcher 20a while units in 24b will be driven by a second instruction stream from fetcher 
20b), Note that this is the purpose of MIMD instructions: to have multiple units execute the 
multiple instructions with multiple data items. 

25. Referring to claim 29, Fernando has taught a processor control method to control a 
plurality of arithmetic units connected with a plurality of instruction control units, comprising: 
switching between simultaneously driving the plurality of arithmetic units by issuing a single 
series of instructions from one of the plurality of instruction control units and independently 
driving each of the plurality of arithmetic units by correspondingly issuing series of instructions 
from the plurality of instruction control units, wherein the switching is performed based on 
contents of processes to be executed. See Fig. 1, component 21 and note that either a first series 
of instructions along bus 36 may be selected for decoding or a second series of instructions from 
bus 35 may be selected for decoding. When the first series of instructions along bus 36 is 
selected, the system is in SEVCD mode (Fig. 3), where the plurality of arithmetic units 
(components 26 in component 24b) are driven at the same time as a second group of arithmetic 
units (components 26 in component 24a) by the same stream of instructions. When the second 
series of instructions along bus 35 is selected, the system is in SIMD mode (Fig. 4), where each 
of the plurality of arithmetic units (components 26 in component 24b) are driven independently 
of a second group of arithmetic units (components 26 in component 24a) by different streams of 
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instructions. Also, from Fig. 5, it should be noted that the switching between modes is done 
based on instructions (contents) of the processes. For instance, when a cfork instruction is 
encountered, SIMD mode is entered where the arithmetic units are simultaneously driven by a 
single stream whereas when a dfork is encountered, the arithmetic units are independently driven 
by multiple instruction streams. 

Claim Rejections - 35 USC §103 

26. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set forth in 
section 102 of this title, if the differences between the subject matter sought to be patented and the prior art are 
such that the subject matter as a whole would have been obvious at the time the invention was made to a person 
having ordinary skill in the art to which said subject matter pertains. Patentability shall not be negatived by the 
manner in which the invention was made. 

27. Claim 7 is rejected under 35 U.S.C. 103(a) as being unpatentable over Parady, as apphed 
above, in view ofDowling, U.S. Patent No. 6,170,051. 

28. Referring to claim 7, Parady has taught a processor control apparatus as described in 
claim 1 , Parady has not taught that each of said series of instructions includes a VLIW type 
instruction. However, Dowling has taught series of instructions that comprise VLIW 
instructions. See the abstract. As is known in the art, and explained in column 1, lines 18-29, 
and lines 45-51, of Dowling, VLIW instructions allow for the utihzation of muhiple functional 
units in the same cycle, as opposed to just one per cycle. This (instruction level parallelism), in 
turn, increases throughput. One of ordinary skill in the art would have recognized that VLIW 
instructions would be beneficial and implementable in Parady because Parady has taught 
multiple functional units, which when used in conjunction with VLIW instructions, would be 
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utilized more efficiently in order to increase throughput. Consequently, it would have been 
obvious to one of ordinary skill in the art at the time of the invention to modify Parady to include 
VLIW instructions. 

29. Claim 9 is rejected under 35 U.S.C. 103(a) as being unpatentable over Parady, as applied 
above, in view of Fernando, as applied above. 

30. Referring to claim 9, Parady has taught a processor control apparatus as described in 
claim 1 . Parady has not taught power control elements for controlling power supply to said 
arithmetic units based on their instruction executing states. However, Fernando has taught such 
a concept. See the abstract and column 8, lines 10-12. Note that when a particular functional 
unit is not in use, it will be powered down, thereby reducing the overall power consumption of 
the system. As is seen in Fig. 1 of Parady, many functional units exist which can execute 
instructions. A person of ordinary skill in the art would have recognized that by implementing 
the power-saving concept of Fernando into the system of Parady, power consumption would be 
reduced by powering down those functional units which are not in use. As a result, it would 
have been obvious to one of ordinary skill in the art at the time of the invention to modify Parady 
to include the powering mechanism taught by Fernando, 

31. Claims 10-12, 15-16, and 24 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Fernando, as applied above. 

32. Referring to claim 10, Fernando has taught a processor control apparatus comprising: 
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a) a single instruction memory for storing a plurality of series of instructions and supplying them 
to arithmetic units. See Fig. 1, components 12 and 26. Fernando has not taught a plurality of 
instruction memories for storing a plurality of series of instructions to be executed by a plurality 
of arithmetic units. However, Official Notice is taken that having a plurality of memories for 
storing independent series of instructions is well known and accepted in the art. In addition, as 
shown in Nerwin v. Erhchman . 168 USPQ 177 (1969), to make separable is generally not given 
patentable weight or would have been an obvious improvement. For instance, a person of 
ordinary skill in the art would have recognized that by implementing a plurality of memories 
instead of a single memory with concurrent access capabilities, as taught by Fernando (evident in 
Fig. 7 where multiple streams are fetched at once), then the circuitry required to allow for 
concurrent access would be eliminated, thereby reducing the complexity and cost of the system. 
Consequently, it would have been obvious to replace Fernando's single instruction memory with 
a plurality of instruction memories. 

b) an instruction decoder for decoding a series of instructions from said instruction memories, 
and outputting a decoded result to any of said plurality of arithmetic units. See Fig. 1, component 
22a, 22b, and 26. 

c) and a selector for selectively switching between a plurality of series of instructions from said 
instruction memories to be decoded by said instruction decoder, and supplying a series of 
instructions thus selected to said instruction decoder. See Fig.l, component 21, and claim 44 of 
Fernando. 

d) wherein said plurality of series of instructions are issued to said plurality of arithmetic units 
from one of the plurality of instruction memories or are issued respectively from said plurality of 
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instruction memories to enable said plurality of arithmetic units to be simultaneously and 
independently driven. It should be noted that it is inherent that a plurality of series (groups) of 
instructions are issued to arithmetic units from one of the instruction memories. Clearly, all 
instructions originate in instruction memory. Furthermore, applicant has used an "or" clause 
which means the examiner only needs to show the teaching of one of the limitations separated by 
the "or" clause. 

33. Referring to claim 11, Fernando has taught a processor control apparatus as described in 
claim 10. Fernando has further taught that some of said plurality of series of instructions contain 
information about selective switching between said series of instructions to be performed by said 
selector, and wherein said instruction decoder decodes said information contained in a series of 
instructions, and outputs a switching instruction to said selector. See column 6, lines 41-54, and 
column 7, lines 17-29, and note that when a CFORK or DFORK is detected by decoder 22a, a 
signal 32i (Fig. 1) is generated, which then causes the muhiplexer 21 (Fig. 1) to select another 
series of instructions, which will then be decoded. 

34. Referring to claim 12, Fernando has taught a processor control apparatus as described in 
claim 10, Fernando has further taught that some of said plurality of series of instructions contain 
a synchronizing instruction for allowing a first predetermined one of said arithmetic units and a 
second predetermined arithmetic unit to synchronously perform processes, and wherein when 
said synchronizing instruction is issued to said first predetermined arithmetic unit, said first 
predetermined arithmetic unit is set in a wait state, and an instruction decoder of said second 
predetermined arithmetic unit does not output a switching instruction to its associated selector if 
a process is being executed by said second predetermined arithmetic unit upon issuance of said 
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synchronizing instruction, and does not release the wait state of said first predetermined 
arithmetic unit until said second predetermined arithmetic unit completes said process. See 
column 7, lines 52-67. Note that a WAIT instruction will pause the first arithmetic unit until a 
DJOIN instruction is executed by the second arithmetic unit (synchronization). Furthermore, the 
stream will not be switched as the second arithmetic unit will continue executing the same thread 
until it is finished (upon a DJOIN instruction). See Fig. 7. 

35. Referring to claim 15, Fernando has taught a processor control apparatus as described in 
claim 10. Fernando has further taught that each of said series of instructions includes a series of 
time-sharing instructions for serially driving a plurality of ones of said arithmetic units. For 
instance, looking at Fig. 7 of Fernando, the series of instructions corresponding to thread 2 
execute from time 2 to time 7. Therefore, the thread takes 6 time units to execute and each 
individual instruction requires some portion of the overall 6 time units for execution. Therefore, 
each instruction is a time-sharing instruction, i.e., each instruction shares the overall 6 time units 
with the other instructions. 

36. Referring to claim 16, Fernando has taught a processor control apparatus as described in 
claim 10. Fernando has further taught power control elements for controlling power supply to 
said arithmetic units based on their instruction executing states. See the abstract and column 8, 
lines 10-12, and note that unused processing elements are deactivated in order to conserve 
power. 

37. Referring to claim 24, Fernando has taught a processor comprising: 
a) a plurality of arithmetic units. See Fig, 1, components 24 and 26. 
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b) a single instruction memory for storing a plurality of series of instructions and supplying them 
to arithmetic units. See Fig. 1, components 12. Fernando has not taught a plurality of instruction 
memories for storing a plurality of series of instructions to be executed by a plurality of 
arithmetic units. However, Official Notice is taken that having a plurality of memories for 
storing independent series of instructions is well known and accepted in the art. In addition, as 
shown in Nerwin v. Erlichman . 168 USPQ 177 (1969), to make separable is generally not given 
patentable weight or would have been an obvious improvement. For instance, a person of 
ordinary skill in the art would have recognized that by implementing a plurality of memories 
instead of a single memory with concurrent access capabilities, as taught by Fernando (evident in 
Fig. 7 where multiple streams are fetched at once), then the circuitry required to allow for 
concurrent access would be eliminated, thereby reducing the complexity and cost of the system. 
Consequently, it would have been obvious to replace Fernando' s single instruction memory with 
a plurality of instruction memories. 

c) an instruction decoder for decoding a series of instructions from said instruction memories, 
and outputting a decoded result to any of said plurality of arithmetic units. See Fig. 1, component 
22a and 22b. 

d) and a selector for selectively switching between a plurality of series of instructions from said 
instruction memories to be decoded by said instruction decoder, and supplying a series of 
instructions thus selected to said instruction decoder. See Fig.l, component 21, and claim 44 of 
Fernando. Note that the MUX can either select a first series of instruction from bus 36 or a 
second series from bus 35. 
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e) wherein said plurality of series of instructions are issued to said plurality of arithmetic units 
from one of the plurality of instruction memories or are issued respectively from said plurality of 
instruction memories to enable said plurality of arithmetic units to be simultaneously and 
independently driven. It should be noted that it is inherent that a plurality of series (groups) of 
instructions are issued to arithmetic units from one of the instruction memories. Clearly, all 
instructions originate in instruction memory. Furthermore, applicant has used an "or" clause 
which means the examiner only needs to show the teaching of one of the Umitations separated by 
the "or" clause. 

38. Claim 13 is rejected under 35 U.S.C. 103(a) as being unpatentable over Fernando, as 
applied above, in view of Parady, as applied above. 

39. Referring to claim 13, Fernando has taught a processor control apparatus as described in 
claim 10. 

a) Fernando has not explicitly taught an instruction queue for temporarily storing, at a stage prior 
to said selector, a series of instructions to be transmitted from a second one of said instruction 
memories different from a first one of said instruction memories which stores a series of 
instructions being executed by said first predetermined arithmetic unit. However, Parady has 
taught employing muUiple instruction queues for holding different series of instructions before 
one of them is selected. See Fig,3, components 102-108. A person of ordinary skill in the art 
would have recognized that an instruction queue allows a system to hold a series of instructions 
within the system. Instructions from the queue will be retrieved faster than if they had to be 
retrieved from main memory since main memory is slower than an on-chip memory, such as 
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Parady's queues. Consequently, it would have been obvious to one of ordinary skill in the art at 
the time of the invention to modify Fernando to include instruction queues on-chip to hold series 
of instruction so that the instructions are retrieved faster than they would from main memory, 
b) and a determiner for determining, based on a series of instructions being executed, whether or 
not the process being performed by said first predetermined arithmetic unit can be interrupted, 
said determiner operating to output, if the process can be interrupted, an interrupt signal for 
interrupting the issuance of the series of instructions to said first instruction memory which is a 
source of the series of instructions being executed, and generate a switching instruction to said 
selector to switch to a series of instructions from said instruction queue. Looking at Fig. 1, it 
should be realized that if the first arithmetic unit 24b is executing a stream of instructions from 
bus 36 and a DFORK instruction is encountered by decoder 22a, then the MUX for the first 
arithmetic unit will be configured such that the execution of a first stream of instructions is 
interrupted in order to execute a second stream of instructions from bus 35. See column 5, lines 
17-25, and column 7, lines 17-29. So, in essence, Fernando has taught interrupting a first 
process in order to execute a second process. 

40. Claims 14, 18-22, and 26-27 are rejected under 35 U.S.C. 103(a) as being unpatentable 
over Fernando, as apphed above, in view of Dowling, as applied above. 

41 . Referring to claim 14, Fernando has taught a processor control apparatus as described in 
claim 10. Fernando has not taught that each of said series of instructions includes a VLIW type 
instruction. However, Dowling has taught series of instructions that comprise VLIW 
instructions. See the abstract. As is known in the art, and explained in column 1, lines 18-29, 
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and lines 45-51, of Dowling, VLIW instructions allow for the utilization of multiple functional 
units in the same cycle, as opposed to just one per cycle. This (instruction level parallelism), in 
turn, increases throughput. One of ordinary skill in the art would have recognized that VLIW 
instructions would be beneficial and implementable in Fernando because Fernando has taught 
multiple functional units (Fig.l, components 26), which when used in conjunction with VLIW 
instructions, would be utilized more efficiently in order to increase throughput. Consequently, it 
would have been obvious to one of ordinary skill in the art at the time of the invention to modify 
Fernando to include VLIW instructions. 

42, Referring to claim 18, Fernando has taught a processor control apparatus for controlling a 
plurality of arithmetic units (Fig. 1, components 24a and 24b), said processor control apparatus 
comprising a plurality of instruction control units for instructing said arithmetic units to execute 
a series of instructions, wherein each of said instruction control units includes: 
a) a single instruction memory for storing a plurality of series of instructions and supplying them 
to arithmetic units. See Fig. 1, components 12 and 26. Fernando has not taught that each 
instruction control unit includes an instruction memory for storing a plurality of series of 
instructions, i.e. a plurality of instruction memories. However, Official Notice is taken that 
having a plurality of memories for storing series of instructions is well known and accepted in 
the art. In addition, as shown in Nerwin v. ErUchmaa 168 USPQ 177 (1969), to make separable 
is generally not given patentable weight or would have been an obvious improvement. For 
instance, a person of ordinary skill in the art would have recognized that by implementing a 
plurality of memories instead of a single memory with concurrent access capabilities, as taught 
by Fernando (evident in Fig. 7 where muUiple streams are fetched at once), then the circuitry 
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required to allow for concurrent access would be eliminated, thereby reducing the complexity 
and cost of the system. Consequently, it would have been obvious to replace Fernando' s single 
instruction memory with a plurality of instruction memories. 

b) an instruction decoder for decoding a series of instructions and supplying the decoded series 
of instructions to an associated one of said arithmetic units. See Fig. 1, components 22a and 22b 
(note that the control units may each comprise at least a decoder, fetch unit, and instruction 
memory). 

c) wherein some of said instruction control units each have an instruction control selector for 
selectively switching between a first series of instructions from a first instruction memory of one 
of said instruction control units simultaneously driving the plurality of arithmetic units and a 
second series of instructions from a second instruction memory of another instruction control 
unit different from said one instruction control unit independently driving each of the plurality of 
arithmetic units to output one of said first and second series of instructions thus selected to said 
instruction decoder See Fig. 1, component 21 and note that either a first series of instructions 
along bus 36 may be selected for decoding or a second series of instructions from bus 35 may be 
selected for decoding. When the first series of instructions along bus 36 is selected, the system is 
in SEVDD mode (Fig. 3), where the plurality of arithmetic units (components 26 in component 
24b) are driven at the same time as a second group of arithmetic units (components 26 in 
component 24a) by the same stream of instructions. When the second series of instructions 
along bus 35 is selected, the system is in SIMD mode (Fig.4), where each of the plurality of 
arithmetic units (components 26 in component 24b) are driven independently of a second group 
of arithmetic units (components 26 in component 24a) by different streams of instructions. 
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d) wherein each of said arithmetic units includes a register file (say 25 b) for storing data during 
execution and the ability to access the register file of another arithmetic unit (25a) via bus 38 by 
using MOVE instructions. See column 6, lines 16-32. Fernando has not taught that each 
arithmetic unit includes a first register file and a second register file for storing data generated by 
said first and second series of instructions, respectively, which are supplied from said first and 
second instruction memories and decoded by an instruction decoder of an associated one of said 
instruction control units, and an arithmetic unit selector for selectively switching between said 
data generated by said first and second series of instructions being executed and stored in said 
first and second register files, respectively, according to an instruction fi-om said associated 
instruction decoder to supply a selected one of said first and second series of instructions to a 
calculator.. However, Dowling has taught such a concept. Note from Fig.2, that arithmetic unit 
200 includes two register sets 160 and 260 for use in executing two different instruction streams 
130 and 230. Based on the stream, one of the register sets is selected by selector 290. By adding 
a second set of registers to the arithmetic unit, according to column 9, lines 59-62, the processing 
hardware can quickly switch both internal context and external context, i.e. quickly switch 
execution from one stream of instructions to another, as Fernando has taught. A person of 
ordinary skill in the art would have also recognized that by having a second register set in an 
arithmetic unit, the MOVE instruction would not be required to move data fi'om one set to 
another in order to facilitate execution of another stream. This would also reduce the complexity 
of the system. Therefore, in order to speed up the system and reduce complexity, it would have 
been obvious to one of ordinary skill in the art at the time of the invention to modify Fernando 
such that each arithmetic unit includes a second register set. 
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43. Referring to claim 19, Fernando in view of Dowling has taught a processor control 
apparatus as described in claim 18. Fernando has not taught that each of said series of 
instructions includes a VLIW type instruction. However, Dowling has taught series of 
instructions that comprise VLIW instructions. See the abstract. As is known in the art, and 
explained in column 1, lines 18-29, and lines 45-51, of DowHng, VLIW instructions allow for 
the utilization of multiple functional units in the same cycle, as opposed to just one per cycle. 
This (instruction level parallelism), in turn, increases throughput. One of ordinary skill in the art 
would have recognized that VLIW instructions would be beneficial and implementable in 
Fernando because Fernando has taught multiple functional units, which when used in 
conjunction with VLIW instructions, would be utilized more efficiently in order to increase 
throughput. Consequently, it would have been obvious to one of ordinary skill in the art at the 
time of the invention to modify Fernando to include VLIW instructions. 

44. Referring to claim 20, Fernando in view of DowUng has taught a processor control 
apparatus as described in claim 18. Fernando has further taught that each of said series of 
instructions includes a series of time-sharing instructions for serially driving a plurality of ones 
of said arithmetic units. For instance, looking at Fig. 7 of Fernando, the series of instructions 
corresponding to thread 2 execute fi*om time 2 to time 7. Therefore, the thread takes 6 time units 
to execute and each individual instruction requires some portion of the overall 6 time units for 
execution. Therefore, each instruction is a time-sharing instruction, i.e., each instruction shares 
the overall 6 time units with the other instructions. 

45. Referring to claim 21, Fernando in view of Dowling has taught a processor control 
apparatus as described in claim 18. Fernando has further taught power control elements for 
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controlling power supply to said arithmetic units based on their instruction executing states. See 
the abstract and column 8, lines 10-12, and note that unused processing elements are deactivated 
in order to conserve power. 

46. Referring to claim 22, Fernando has taught a processor control apparatus for controlling a 
plurality of arithmetic units (Fig.l, components 24 and 26), said processor control apparatus 
comprising a plurality of instruction control units (Fig.l) for instructing said arithmetic units to 
execute a series of instructions. 

a) wherein said instruction control units have a single instruction memory used in common for 
storing a plurality of series of instructions (see Fig.l, component 12), and each includes an 
instruction decoder for decoding a series of instructions and supplying the decoded series of 
instructions to an associated one of said arithmetic units (see Fig.l, components 22a and 22b, and 
note that the control units may each comprise at least a decoder, fetch unit, and instruction 
memory), and said single instruction memory has a plurality of ports for issuing said series of 
instructions to said respective instruction decoders (See Fig.l component 12 and note that 
multiple ports are connected to multiple fetch units and ultimately to multiple decoders). 

b) wherein some of said instruction control units each have an instruction control selector for 
selectively switching between a first series of instructions simultaneously driving the plurality of 
arithmetic units and a second series of instructions independently driving each of the plurality of 
arithmetic units both from said single instruction memory to output one of said first and second 
series of instructions thus selected to said instruction decoder. See Fig. 1, component 21 and note 
that either a first series of instructions along bus 36 may be selected for decoding or a second 
series of instructions from bus 35 may be selected for decoding. When the first series of 
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instructions along bus 36 is selected, the system is in SIMD mode (Fig.3), where the plurality of 
arithmetic units (components 26 in component 24b) are driven at the same time as a second 
group of arithmetic units (components 26 in component 24a) by the same stream of instructions. 
When the second series of instructions along bus 35 is selected, the system is in SEVED mode 
(Fig. 4), where each of the plurality of arithmetic units (components 26 in component 24b) are 
driven independently of a second group of arithmetic units (components 26 in component 24a) 
by different streams of instructions. 

c) wherein each of said arithmetic units includes a register file (say 25b) for storing data during 
execution and the ability to access the register file of another arithmetic unit (25a) via bus 38 by 
using MOVE instructions. See column 6, lines 16-32. Fernando has not taught that each 
arithmetic unit includes a first register file and a second register file for storing data generated by 
said first and second series of instructions, respectively, which are supplied from said first and 
second instruction memories and decoded by an instruction decoder of an associated one of said 
instruction control units, and an arithmetic unit selector for selectively switching between said 
data generated by said first and second series of instructions being executed and stored in said 
first and second register files, respectively, according to an instruction from said associated 
instruction decoder to supply a selected one of said first and second series of instructions to a 
calculator.. However, Dowling has taught such a concept. Note fi'om Fig.2, that arithmetic unit 
200 includes two register sets 160 and 260 for use in executing two different instruction streams 
130 and 230. Based on the stream, one of the register sets is selected by selector 290. By adding 
a second set of registers to the arithmetic unit, according to column 9, lines 59-62, the . processing 
hardware can quickly switch both internal context and external context, i.e. quickly switch 
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execution from one stream of instructions to another, as Fernando has taught. A person of 
ordinary skill in the art would have also recognized that by having a second register set in an 
arithmetic unit, the MOVE instruction would not be required to move data from one set to 
another in order to facilitate execution of another stream. This would also reduce the complexity 
of the system. Therefore, in order to speed up the system and reduce complexity, it would have 
been obvious to one of ordinary skill in the art at the time of the invention to modify Fernando 
such that each arithmetic unit includes a second register set. 
47. Referring to claim 26, Fernando has taught a processor comprising: 

a) a plurality of arithmetic units (Fig.l, components 24 and 26). 

b) a plurality of instruction control units (Fig.l) for driving said arithmetic units in a controlled 
manner, wherein each of said instruction control units includes: 

c) a single instruction memory for storing a plurality of series of instructions and supplying them 
to arithmetic units. See Fig. 1, components 12 and 26. Fernando has not taught that each 
instruction control unit includes an instruction memory for storing a plurality of series of 
instructions, i.e. a plurality of instruction memories. However, Official Notice is taken that 
having a plurality of memories for storing series of instructions is well known and accepted in 
the art. In addition, as shown in Nerwin v. Erlichman , 168 USPQ 177 (1969), to make separable 
is generally not given patentable weight or would have been an obvious improvement. For 
instance, a person of ordinary skill in the art would have recognized that by implementing a 
plurality of memories instead of a single memory with concurrent access capabilities, as taught 
by Fernando (evident in Fig. 7 where multiple streams are fetched at once), then the circuitry 
required to allow for concurrent access would be eliminated, thereby reducing the complexity 



Application/Control Number: 09/855,776 Page 29 

Art Unit: 2183 

and cost of the system. Consequently, it would have been obvious to replace Fernando 's single 
instruction memory with a plurality of instruction memories. 

d) an instruction decoder for decoding a series of instructions and supplying the decoded series 
of instructions to an associated one of said arithmetic units. See Fig.l, components 22a and 22b 
(note that the control units may each comprise at least a decoder, fetch unit, and instruction 
memory). 

e) wherein some of said instruction control units each have an instruction control selector for 
selectively switching between a first series of instructions fi-om a first instruction memory of one 
of said instruction control units simultaneously driving the plurality of arithmetic units and a 
second series of instructions from a second instruction memory of another instruction control 
unit different from said one instruction control unit independently driving each of the plurality of 
arithmetic units to output one of said first and second series of instructions thus selected to said 
instruction decoder, based on contents of processes to be executed. See Fig. 1, component 21 and 
note that either a first series of instructions along bus 36 may be selected for decoding or a 
second series of instructions from bus 35 may be selected for decoding. When the first series of 
instructions along bus 36 is selected, the system is in SIMD mode (Fig.3), where the plurality of 
arithmetic units (components 26 in component 24b) are driven at the same time as a second 
group of arithmetic units (components 26 in component 24a) by the same stream of instructions. 
When the second series of instructions along bus 35 is selected, the system is in SIMD mode 
(Fig. 4), where each of the plurality of arithmetic units (components 26 in component 24b) are 
driven independently of a second group of arithmetic units (components 26 in component 24a) 
by different streams of instructions. Also, from Fig. 5, it should be noted that the switching 
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between modes is done based on instructions (contents) of the processes. For instance, when a 
cfork instruction is encountered, SIMD mode is entered where the arithmetic units are 
simuhaneously driven by a single stream whereas when a dfork is encountered, the arithmetic 
units are independently driven by multiple instruction streams. 

f) wherein each of said arithmetic units includes a register file (say 25b) for storing data during 
execution and the ability to access the register file of another arithmetic unit (25a) via bus 38 by 
using MOVE instructions. See column 6, lines 16-32. Fernando has not taught that each 
arithmetic unit includes a first register file and a second register file for storing data generated by 
said first and second series of instructions, respectively, which are supplied from said first and 
second instruction memories and decoded by an instruction decoder of an associated one of said 
instruction control units, and an arithmetic unit selector for selectively switching between said 
data generated by said first and second series of instructions being executed and stored in said 
first and second register files, respectively, according to an instruction from said associated 
instruction decoder to supply a selected one of said first and second series of instructions to a 
calculator.. However, Dowling has taught such a concept. Note fi'om Fig. 2, that arithmetic unit 
200 includes two register sets 160 and 260 for use in executing two different instruction streams 
130 and 230. Based on the stream, one of the register sets is selected by selector 290. By adding 
a second set of registers to the arithmetic unit, according to column 9, lines 59-62, the processing 
hardware can quickly switch both internal context and external context, i.e. quickly switch 
execution from one stream of instructions to another, as Fernando has taught. A person of 
ordinary skill in the art would have also recognized that by having a second register set in an 
arithmetic unit, the MOVE instruction would not be required to move data fi-om one set to 
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another in order to facilitate execution of another stream. This would also reduce the complexity 
of the system. Therefore, in order to speed up the system and reduce complexity, it would have 
been obvious to one of ordinary skill in the art at the time of the invention to modify Fernando 
such that each arithmetic unit includes a second register set. 
48. Referring to claim 27, Fernando has taught a processor comprising: 

a) a plurality of arithmetic units (Fig.l, components 24 and 26). 

b) a plurality of instruction control units (Fig. 1) for driving said arithmetic units in a controlled 
manner. 

c) wherein said instruction control units have a single instruction memory used in common for 
storing a plurality of series of instructions (see Fig.l, component 12), and each includes an 
instruction decoder for decoding a series of instructions and supplying the decoded series of 
instructions to an associated one of said arithmetic units (see Fig.l, components 22a and 22b, and 
note that the control units may each comprise at least a decoder, fetch unit, and instruction 
memory), and said single instruction memory has a plurality of ports for issuing said series of 
instructions to said respective instruction decoders (See Fig.l component 12 and note that 
multiple ports are connected to multiple fetch units and ultimately to multiple decoders). 

e) wherein some of said instruction control units each have an instruction control selector for 
selectively switching between a first series of instructions simultaneously driving the plurality of 
arithmetic units and a second series of instructions independently driving each of the plurality of 
arithmetic units both from said single instruction memory to output one of said first and second 
series of instructions thus selected to said instruction decoder. See Fig. 1, component 21 and note 
that either a first series of instructions along bus 36 may be selected for decoding or a second 
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series of instructions from bus 35 may be selected for decoding. When the first series of 
instructions along bus 36 is selected, the system is in SIMD mode (Fig.3), where the plurality of 
arithmetic units (components 26 in component 24b) are driven at the same time as a second 
group of arithmetic units (components 26 in component 24a) by the same stream of instructions. 
When the second series of instructions along bus 35 is selected, the system is in SIMD mode 
(Fig.4), where each of the plurality of arithmetic units (components 26 in component 24b) are 
driven independently of a second group of arithmetic units (components 26 in component 24a) 
by different streams of instructions. Also, from Fig.5, it should be noted that the switching 
between modes is done based on instructions (contents) of the processes. For instance, when a 
cfork instruction is encountered, SIMD mode is entered where the arithmetic units are 
simultaneously driven by a single stream whereas when a dfork is encountered, the arithmetic 
units are independently driven by multiple instruction streams. 

f) wherein each of said arithmetic units includes a register file (say 25b) for storing data during 
execution and the ability to access the register file of another arithmetic unit (25a) via bus 38 by 
using MOVE instructions. See column 6, hnes 16-32. Fernando has not taught that each 
arithmetic unit includes a first register file and a second register file for storing data generated by 
said first and second series of instructions, respectively, which are suppUed fi-om said first and 
second instruction memories and decoded by an instruction decoder of an associated one of said 
instruction control units, and an arithmetic unit selector for selectively switching between said 
data generated by said first and second series of instructions being executed and stored in said 
first and second register files, respectively, according to an instruction fi'om said associated 
instruction decoder to supply a selected one of said first and second series of instructions to a 
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calculator.. However, Dowling has taught such a concept. Note from Fig. 2, that arithmetic unit 
200 includes two register sets 160 and 260 for use in executing two different instruction streams 
130 and 230, Based on the stream, one of the register sets is selected by selector 290. By adding 
a second set of registers to the arithmetic unit, according to column 9, lines 59-62, the processing 
hardware can quickly switch both internal context and external context, i.e. quickly switch 
execution from one stream of instructions to another, as Fernando has taught. A person of 
ordinary skill in the art would have also recognized that by having a second register set in an 
arithmetic unit, the MOVE instruction would not be required to move data from one set to 
another in order to facilitate execution of another stream. This would also reduce the complexity 
of the system. Therefore, in order to speed up the system and reduce complexity, it would have 
been obvious to one of ordinary skill in the art at the time of the invention to modify Fernando 
such that each arithmetic unit includes a second register set. 

Response to Arguments 

49. Applicant's arguments filed on March 1 1, 2005 have been fully considered but they are 
not persuasive. 

50. In general, applicant argues that the prior art of record does not teach switching between 
a first series of instructions for simultaneously driving a plurality of arithmetic units and a 
second series of instructions for independently driving the plurality of arithmetic units, 

5 1 . These arguments are not found persuasive for the reasons set forth in the rejections of the 
claims above. However, in summation, looking at Fig.l, component 21, notice that either a first 
series of instructions along bus 36 may be selected for decoding or a second series of instructions 
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from bus 35 may be selected for decoding. When the first series of instructions along bus 36 is 
selected, the system is in SIMD mode (Fig,3), where the plurality of arithmetic units 
(components 26 in component 24b) are driven at the same time as a second group of arithmetic 
units (components 26 in component 24a) by the same stream of instructions. When the second 
series of instructions along bus 35 is selected, the system is in SEMD mode (Fig.4), where each 
of the plurality of arithmetic units (components 26 in component 24b) are driven independently 
of a second group of arithmetic units (components 26 in component 24a) by different streams of 
instructions. Also, from Fig. 5, it should be noted that the switching between modes is done 
based on instructions (contents) of the processes. For instance, when a cfork instruction is 
encountered, SDVDD mode is entered where the arithmetic units are simultaneously driven by a 
single stream whereas when a dfork is encountered, the arithmetic units are independently driven 
by multiple instruction streams. Furthermore, applicants argue that their system switches 
between SIMD (simultaneous) and VLIW (independent) mode, however, this limitation is not 
within the claims. 

Conclusion 

52. THIS ACTION IS MADE FINAL. Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within TWO 
MONTHS of the mailing date of this final action and the advisory action is not mailed until after 
the end of the THREE-MONTH shortened statutory period, then the shortened statutory period 
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will expire on the date the advisory action is mailed, and any extension fee pursuant to 37 
CFR 1 .136(a) will be calculated from the mailing date of the advisory action. In no event, 
however, will the statutory period for reply expire later than SIX MONTHS from the mailing 
date of this final action. 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to David J, Huisman whose telephone number is (571) 272-4168. 
The examiner can normally be reached on Monday-Friday (8:00-4:30). 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Eddie Chan can be reached on (571) 272-4162. The fax phone number for the 
organization where this application or proceeding is assigned is 703-872-9306. 

Information regarding the status of an application may be obtained from the Patent 
Application Information Retrieval (PAIR) system. Status information for published applications 
may be obtained from either Private PAIR or PubUc PAIR. Status information for unpublished 
applications is available through Private PAIR only. For more information about the PAIR 
system, see http://pair-direct.uspto.gov. Should you have questions on access to the Private PAIR 
system, contact the Electronic Business Center (EEC) at 866-217-9197 (toll-free). 



DJH 

David J. Huisman 
May 11, 2005 




